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ABSTRACT 

One possible diagnostic of planet formation, orbital migration, and tidal evolution is the angle ip 
between a planet's orbital axis and the spin axis of its parent star. In general, ij) cannot be measured, 
but for transiting planets one can measure the angle A between the sky projections of the two axes 
via the Rossiter-McLaughlin effect. Here, we show how to combine measurements of A in different 
systems to derive statistical constraints on -0. We apply the method to 11 published measurements 
of A, using two different single-parameter distributions to describe the ensemble. First, assuming a 
Rayleigh distribution (or more precisely, a Fisher distribution on a sphere), we find that the peak 
value is less than 22° with 95% confidence. Second, assuming a fraction / of the orbits have random 
orientations relative to the stars, and the remaining fraction (1 — /) are perfectly aligned, we find 
/ < 0.36 with 95% confidence. This latter model fits the data better than the Rayleigh distribution, 
mainly because the XO-3 system was found to be strongly misaligned while the other 10 systems 
are consistent with perfect alignment. If the XO-3 result proves robust, then our results may be 
interpreted as evidence for two distinct modes of planet migration. 

Subject headings: celestial mechanics — planetary systems — stars: rotation — methods: statistical 



1. INTRODUCTION 

In a planetary system, the angle ip between the or- 
bital axis and the stellar rotation axis may provide clues 
about the processes that sculpt planetary orbits. As 
an example, the angle between the Sun's rotation axis 
and the north eclip tic pole is Vo = 7.15 deg (see, e.g., 
iBeck fc Giles! [20051 ) ■ The smallness of V'Oj along with 
the small mutual inclinations between planetary orbits, 
is prima facie evidence for formation in a spinning disk. 
The smallness of ipQ has also been used to constrain the 
properties of any "Plane t X" or solar companion star 
(|Goldreich fc War"dlll972D , and t o place upper bou nds on 
violations of Lorentz invariance (lNordtvedtlll98l . That 
ipQ is not even closer to zero has been interpreted as 
evidence for an early close encounter with another star 
(|Helleijll993[) or a non-axisymmetric, "twisting'' colla pse 
of the Sun's parent molecular cloud (|Tremainelll99lD . 

For exoplanets, it has been recognized that ip \s a. 
possible diagnostic of theories of planet migration. 
Some of the mechanisms that have been proposed 
to produce close-in giant planets \yould preserve an 
initial spin-orbit ali gnment ([Lin et al.lll996t [Warcilll997l : 
iMurrav et al.l I1998D . while others w ould produce at 
least o ccasionally large misalignments (Ford et al."2001; 
Yu fc Trcmain c 2001: Papaloizou fc Terquem 200ll: 
Terqucm fc Pa paloizoull2002t iMarzari fc Weidenschillina 



20021: .Thommes fc Lissaue: 



Fabrvckv fc Tremaind 1200 



I 



I2OO3F IWu et al 
iChatteriee et al 



2007 



2008 



Juric fc Tremaind l2008t iNagasawa et al.l 12008 "). Tides 



raised on the star are not expec ted to play a major role 
in altering ip ()Winn et al.l 120051 ). but it is possi ble that 
coplanarization is more efficient than expected (|Mazehl 
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l2008l : lPonil2008f ). For example, if a hot Jupiter migrated 
inward before its host star contracted onto the main 
sequence, the distended stellar envelope could produce 
more pronounced tidal effects (jZahn fc Bouchetl Il989l : 
iDobbs-Dixon et al.l[200l . 

Independent of the interpretation, the angle '0 is a fun- 
damental geometric property, and for this reason alone 
it is worth seeking empirical constraints on ip whenever 
possible. We regard ip to be on a par with the semima- 
jor axis and the eccentricity: all of them are basic orbital 
parameters for which accurate and systematic measure- 
ments can lead to revealing discoveries and statistical 
constraints on exoplanetary system architectures. 

For a generic exoplanet discovered by the Doppler 
method, no information about spin-orbit alignment is 
available. For transiting exoplanets, one may exploit 
a spectroscopic phenomenon known as the Rossiter- 
McLaughlin (RM) effect. During a transit, the planet 
hides part of the rotating stellar disk and causes the 
stellar spectral lines to be slightly distorted. The dis- 
tortion is usually manifested as an "anomalous" Doppler 
shift of order /S.V — -{Rp/ Rs)'^Vp, where Rp/Rs is the 
planet-to-star radius ratio, and Vp is the projected rota- 
tion rate of the hidden portion of the stellar photo sphere 
(|Ohtaet al.ll2005l : lGim^nez|[200l iGaudi fc Wi^ [2007'). 
Because photometric observations give a precise and in- 
dependent measure of (Rp/Rs)'^, spectroscopic monitor- 
ing of AV reveals Vp{t), thereby allowing one to chart 
the planet's trajectory relative to the sky-projected stel- 
lar rotation axis. 

An important limitation of the RM technique is that 
it is sensitive only to the angle A between the sky projec- 
tions of the orbital and rotational axes.** We refer to ip as 

Strictly speaking, the RM signal depends more directly on the 
angle A' between the transit chord and the sky-projected stellar 
rotation axis. For an eccentric orbit this angle may differ from 
the angle A between the sky projections of the orbital axis and the 
stellar rotation axis. It is straightforward to relate A to A' when the 
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TABLE 1 
Summary of RM measurements 



Exoplanot Projected spin-orbit angle A [deg] References 



HD 189733b 


-1.4± 1.1 


1 


HD 209458b 


0.1 ±2.4 


2,3,4,5,6* 


HAT-P-lb 


3.7 ±2.1 


7 


CoRoT-Exo-2b 


7.2 ±4.5 


8 


HD 149026b 


1.9 ±6.1 


9,6* 


HD 17156b 


9.4 ±9.3 


10,11* 


TrES-2b 


-9.0 ±12.0 


12 


HAT-P-2b 


1.2 ± 13.4 


13*, 14 


XO-3b 


70.0 ± 15.0 


15 


WASP-14b 


-14.0 ±17.0 


16 


TrES-lb 


30.0 ±21.0 


17 



Not e. — References: (1) 'Winn et al.l (f2009); (2) 'Queloz et alj 
1I2OOOI'): ('3')lB undv & Marcv (2000); (4) Wittenmyer et al. (2005); 

t) 'Winn etal. (2005); (6) Winn & Johnson (in pre p.); (7) 
hnson et al. (2008); (8) Bouchy et al. (2008); (9) W olf et al.l 
007); (10) Narita et al. (2008); (11) Cochran et al. (200 g); (12) 
IWinn et al.l (2008): (13) Winnetal. (2007b); (14) Loeill e"t et al'l 
II2OO8I); (15 ) iHebrard et al, (2008 ); (16) Joshi et al.. (.2009i) ; (17) 
INaritact al.l (120071) . Where multiple references are given, the 
quoted result is taken from the starred reference. 

the spin-orbit angle, and to A as the projected spin-orbit 
angle. In general the line-of-sight component of the stel- 
lar rotation axis is unknown. When |A| is small, then |A| 
is a lower limit on -0. (The situation is a bit more com- 
plex for large |A|, as will be shown in this paper.) While 
the finding of a large value of A implies that there is a 
large spin-orbit misalignment, with consequent implica- 
tions for the system's dynamical history, the finding of a 
small value of A has a more ambiguous interpretation. 

The way to overcome this limitation and draw gen- 
eral inferences about spin-orbit alignment is to con- 
sider the ensemble of RM results. The situation is 
similar to the early days of Doppler planet detection. 
Doppler measurements give only A/pSinio, where Mp 
is the planet's mass and io is the orbital inclination. 
When there were only a few detections, it was impos- 
sible to draw firm conclusions about the mass distribu- 
tion of the planets, or even to be completely certain that 
they were planets and not brown dwarfs in fa ce-on orbits 
(|Mazeh et al.lll998t IStepinski fc BlacS |2000D . However, 
once tens of systems were known with precise measure- 
ments of M p sin in, the pla.net ary mass distribution came 
into focus (Jorissen et al.' ' 20011 IZucker fc Mazehl |2001| : 
[Tabachnik fc Tremaine 2004 ), under the reasonable as- 
sumption that the orbits are randomly oriented in space. 

There are now 11 exoplanetary systems for which RM 
measurements have been reported. The results are sum- 
marized in Table [TJ The time is ripe to undertake an 
analogous study of the statistical constraints on spin- 
orbit alignment. It is worth drawing attention to the en- 
tries for HD 209458 and HD 149026, for which we are us- 
ing updated determinations of A by Winn & Johnson (in 
preparation). For HD 209458, the revision is due to an 
improved analysis method taking into account correlated 
errors in the radial-velocity data. For HD 149026, a bet- 
ter transit light curve led to enhanced precision in A. 

The results in Table 1 are easily summarized: each in- 
orbital eccentricity and argument of pericenter are known from the 
Doppler orbit of the star. For the systems considered in this paper, 
the maximum difference between A and A' is approximately 2° (for 
HAT-P-2) and is in all cases much smaller than the measurement 
uncertainty in A. 



dividual system besides XO-3 is consistent with perfect 
spin-orbit alignment within 2a. However it is not obvi- 
ous what exactly is ruled out by these results, or what 
we may conclude about the "typical" value of -0 among 
the transiting planets. The purpose of this paper is to 
provide a statistical framework for understanding statis- 
tical constraints on spin-orbit alignment that follow from 
RM observations, and apply it to the current data. We 
are primarily concerned with the empirical information 
about the distribution of ■;/', rather than the interpreta- 
tion in terms of migration theories or tidal effects, which 
will be the subject of future studies. 

This paper is organized as follows. The geometry of 
this problem is defined in § [21 The relevant probability 
distributions for individual systems are derived in § |3l 
Constraints on based on an ensemble of RM observa- 
tions are discussed in § 2] § [S] gives a summary of the 
results, a discussion of some limitations of our analysis, 
and some suggestions for future work. 

2. SPHERICAL GEOMETRY OF THE 
ROSSITER-MCLAUGHLIN EFFECT 

Let rio and rij denote the unit vectors in the directions 
of the orbital angular momentum and stellar rotational 
angular momentum, respectively. The angle between no 
and ris, is the "spin-orbit angle," denoted 0. This is pre- 
sumably the only angle of intrinsic physical significance 
in this problem, possibly bearing information about the 
initial condition for planet formation, the endpoint of 
planet migration, or the result of tidal evolution. How- 
ever, if) is not directly measurable, and we must introduce 
some other angles. 

Figure 1 shows two useful coordinate systems. In the 
"observer-oriented" coordinate system shown in the left 
panel, Z points at the observer, X points along the line 
of nodes of the planetary orbit, and Y completes a right- 
handed triad. The ascending node of the planet (the 
location where the planet pierces the sky plane with 
Z > 0) is at X < 0. In this coordinate system, rio 
is in the Y Z plane and is specified by the inclination 
angle io = arccos(no • Z)^ which ranges from to tt. 
Specifying ns requires two angles, the inclination angle 
is — arccos(ns • Z) and an azimuthal angle, which by the 
convention of Ohta, Taruya, fc Suto (2005) we take to 
be A, measured clockwise on the sky from the F-axis to 
the sky projection of ng. In summary, 

no — Ysinio + Zcofiio (1) 

TUs— X sin is sin A + y sin is cos A + Z cos is- (2) 

For a transiting planet, io is measurable via transit 
photometry, A (the "projected spin-orbit angle") is mea- 
surable via the RM effect. Usually there is no direct 
measurement of i^, altho ugh it is possible to co nstrain 
is via asteroseismology (jGizon fc Solankil [Mil or by 
combining estimates of the stellar radius, stellar rota- 
tion period, and p rojected rotational velocity (see, e.g., 
|W inn et alll2007a| ). By symmetry, a configuration (io,A) 
cannot be distinguished from a different configuration 
(tt — io, —A). Because of this degeneracy we restrict io to 
the range [0,7r/2], and allow A to range from — tt to +7r. 
A positive (negative) value for A means that, from the 
observer's perspective, the projected stellar spin axis is 
rotated clockwise (counterclockwise) with respect to the 
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projected orbit normal. Values of |A| greater than vr — 
correspond to retrograde orbits. 

In the "orbit-oriented" coordinate system X'Y'Z' 
shown in the right panel of Figure 1, we define Y' = Uq. 
This system is related to XYZ by a rotation of 7r/2 — io 
about the X axis. We define ip and as the polar and 
azimuthal angles of in this system, viz., 

Us — X' sin ip sin + Y' cos ip — Z' sin ip cos il. (3) 

Equation ([2|) may also be rewritten using the rotation 
transformation equations 

X' = X, (4) 

Y' = Y sinio + Z cosio, (5) 

Z' = —Y cosio + Z sinio, (6) 

giving 

Hs —X'sinis sinA (7) 

+Y' (sin is cos A sin io + cos ig cos io) + 

+Z'{co8is sinio — sinig cos A cosio). 

Setting the components of equation ^ equal to those 
of equation ^ we obtain three relations 

sin is sin A = sin sin O (8) 

cos ip — sin is cos A sin io -I- cos is cos io (9) 

sin'0cosi7 = sinis cos A cosio — cosi^ sinio, (10) 

which will be used in the following sections to derive 
constraints on ip based on measurements of io and A and 
on reasonable assumptions regarding is and Q. 

3. GIVEN V, WHAT WILL RM OBSERVATIONS SHOW? 

Suppose an observer has measured the orbital incli- 
nation of a planetary system to be io and is about to 
measure the RM effect. If the spin-orbit angle of the 
system is ip, then what is the probability distribution 
for the projected spin-orbit angle A that the observer 
will measure? In this section we calculate this function, 
p{X\ip,io), which will play an important role in the cal- 
culations to follow. 



We assume that for a given -0, the probability distri- 
bution of the azimuthal angle Q is uniformly distributed 
between — tt and -t-7r. This is self-evident for a circular 
orbit, as there is no physical reason to distinguish any 
particular azimuth. For an eccentric orbit it is conceiv- 
able that fl is correlated with the direction of pericenter, 
but this possibility seems unlikely for hot Jupiters, be- 
cause the torque exerted on the stellar rotational bulge 
by the planetary orbit will cause O to precess. The pre- 
cession period for a hot Jupiter orbit is much shorter 
than the age of the system, and the secular evolution of 
both the argument of pericenter and the longitude of the 
ascending node — both defi ned with re spect to the stellar 
equator — is linear in time ()Rovl 120051 : §11.4.2, to lowest 
order in stellar shape parameters). We therefore expect 
an ensemble of stellar spins to have a uniform distribu- 
tion in ri, even if their orbits are eccentric. Conversely, 
the spin-or bit angle i p remains constant over the preces- 
sion cycle (lRovll2005l: §11.4.2). 

To derive p(A|'0, io), we first express A in terms of ip, 
io, and O by eliminating is from equations ([8|)- (fT0|) : 



X{ip, io, ^) — arctan 



sin ip sin f2 



cos "0 sin io -t- sin tp cos il cos io 



(11) 

Since X{—fl,xl:,io) = —X{fl,^p,io), to calculate probabil- 
ities of A and ip we need only consider n and A in the 
range [0,7r]. The results will apply to negative values of 
A as positive and negative values occur with equal prob- 
ability. 



Next, making use of p{fl) 
ables from to A: 



, we transform vari- 



N 



j{X\tp,io) = ^p{ni\tp,io) 



dn 



dX 



1 ^ 



dn 
Ix 



(12) 

where the sum ranges over the N solutions Qi of equa- 
tion PT|) . for a given choice of A, tp, and io- We now find 
those solutions. 
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Letting u = cosil, equation (jlip is equivalent to a quadratic equation in u: 

1? [sin^ V(l + tan^ A cos^ io)] + 2u [tan^ A sin cos V' sin io cos «o] — svc? ip co^ io- (13) 
When ip < io and sin A < sinip/ sinio, this equation has two roots: 



^ — tan^ A cos ^ sin io cos io ± sec A^/ sin^ — sin^ io sin^ A 
sini/'(l + tan^ A cos-^ «o) 

Fig. 2a shows the geometrical interpretation of this two-root case. The ellipse on the sky that is traced out by 
as takes on all values (the "n^ ellipse" ) has two intersections with the ray that corresponds to the given value of 
A (the "A ray"). When ip < io and sin A > sin-0/ sinio, there are no real roots of equation (fT4| . Geometrically, the 
Us ellipse is too small to intersect with the A ray. The last case is io < "0 ^ ~ *o, for which equation (fT4|) has one 
real root. Fig. 2b shows an example of this case. The Ug ellipse encloses the origin, and there is one intersection 
point with each A ray. This single root corresponds to fii in equation (fT4|) . Thus in equation (fT2|) . = 0, 1, or 2, 
depending on the values of ip, io, and A. 

Because of the derivatives \di}/dX\ in equation (fT^ it is easier to derive the cumulative distribution P{X < Xc\ip,io) 
than to solve directly for p{X\ip, io)- The results are: 

1 + -^{^1 — fl2), Ip < io and Ac < arcsin(sin i/;/ sin io) 
1, Ip < io and Ac > arcsin(sin i/)/ sin io) 

P(A<Acl0,Zo)= <( ^, io<lP<7T-to } (15) 

i(r2i — ^2)-, Ip > TT — io and Ac > tt — arcsin(sin'0/ sin io) 

0, Ip > TT — io and Ac < vr — arcsin(sin'0/ sin io) 

where fli^2 from equation are evaluated at Ac, ip, io- 

The probability densities are obtained by differentiation. First we evaluate the derivatives dfli/dX: 



d^li,2 ( 2tanAsec^A 



dX Vsinrii.2 sin-0(l — tan^ Acos'^ 1 



cos A(sin^ io — sin^ ip) 
2(sin^ Ip - sin^ io sin^ A)i/2 



I I - - I 9 i^UD /\ 1 0111 to — 0111 'i^ ) \ /-, /i\ 

X ( cos -0 sm Zo COS lo + sm 7/) cos ZoCOSiji^2 2 / ^"2" — ^TTTTTo)- ^^^) 



Finally, we calculate p(A|V', io) by inserting these derivatives into equation (|12p . The full expressions are too large 
to reproduce here; instead we give the expressions into which equation (fT6|) may be substituted: 



'^a" ~ ^if) ' ^^""^^ sinio and sinA < sinip/ sinio 



p{X\ip,io) = { 0, sin 1/) < sinio and sin A > sin?/;/ sinio } (17) 

sinV'> sinio 



For a transiting planet, io is always close to 7r/2. When io = 7r/2 exactly, the results are simplified as 

1/2-1 



PiX< Xo\ip,io^Tr/2) 



1 I \ — '^QQ V' 
sin \ cos-^ Ac 



|A-7r/2| > \iJj-tt/2\ 



1, ip<Xc< tt/2 

0, ^p> Xc> 7r/2 



(18) 



and 

/M,- /o^ /I — w .T'^ -2 1,1/2 , |A- V2| > |V^-7r/2| \ 
p(A|^,Zo^vr/2).|-os.(cos-.-cos > j,_^;2|< J-vr/2|| ^^^^ 

In the degenerate case io = 7r/2 and ip = 7r/2, A is observed to be either —tt/2 or tt/2 with equal probability. 
Figure [3] shows the probability densities and cumulative distributions for io — 90° and io = 80° and some represen- 
tative values of ip- To gain an intuitive appreciation of the results, consider an edge-on orbit with ip < 90°, shown 
in the left halves of the upper two panels. In this case the spin-orbit angle ip is an upper bound on its sky-projected 
version A. For ip = 30°, the chance of observing A to be smaller than i/; by a factor of two is approximately 35%. In 
contrast, for ip = 85°, the chance of observing A smaller than by a factor of 2 is only «5%. In this sense, A is a 
more faithful indicator of ip when ip is large. For non-edge-on orbits (the lower panels), the maximum value of A is 
increased, and for a non-edge-on orbit with ip near 90°, it is possible to observe any value of A. 
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Fig. 2. — Geometry of the roots of equation ([T4j. For given values of tp and io there are either 0, 1, or 2 possible values of A. In 
these panels the dashed line is the ellipse on the sky (the "ns ellipse") that is traced out as lis (thin vectors) sweeps around no (thick 
vector), with f2 taking on all values, and the dotted line is the ray corresponding to a given value of A (the "A ray"), (a) iji < io and 
sin A < sinij}/ sinio. There are two intersections of the ellipse and A ray. (b) io < tj) < tt — io- There is only one intersection. For < io 
and sin A > s'mip/ sinio (not shown), the ns ellipse is too small to intersect the A ray. 



Edge-On Orbit, i„ = 90° Edge-On Orbit, i„ = 90° 




X [deg] [deg] 



Fig. 3. — Probability distributions for the projected spin-orbit angle A conditioned on the true spin-orbit angle tp and the orbital 
inclination io- See Eqs. I I14I19I I. The upper panels show results for an edge-on orbit (io = 90°) and various choices of i/;. The lower panels 
show results for a nearly edge-on orbit {io = 80°) and various choices of ip. 
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3.1. Given A from RM observations, what may be 
inferred about ip ? 

Suppose an observer has just measured io and A for a 
particular transiting system. What may be reasonably 
inferred about the spin-orbit angle V'? We appeal to 
Bayes' theorem: 



p(V'|A, io) ocp(A|?/', jo)p(V'), 



(20) 



where p{\\il),io) was calculated in the previous section, 
and p{^) is the "prior" distribution, quantifying the ob- 
server's assumptions prior to the measurement. In this 
section we adopt a prior distribution p{ip) = sin ip, imply- 
ing that ris and rio are uncorrelated and is randomly 
oriented in space. This is the most uninformative or con- 
servative assumption, in the sense that if rio and are 
instead highly correlated (with consequent implications 
for the theory of planet migration or tidal evolution), this 
fact should be demonstrated based on the data, rather 
than assumed from the outset. Hence, p(V'|A,«o) may 
be obtained by multiplying equation (fT7| by sin?/; and 
renormalizing. For brevity, we give here only the ana- 
lytic results for the case of an edge-on orbit and A < 7r/2, 
measured with no error: 

r 0, V < A 

p('(/'|A,lo = 7r/2) = <^ cos »^ sin ^ W; > A 

L COS A(cos-^ A— cos-^ '0)^/^ ' ^ — 

.(21) 

and the corresponding cumulative probability function is 



P(V'<V'c|A,io = 7r/2) 



1 - 



0, 

COS^ Ipc 

cos-^ A 



1/2 




These results are plotted in Figure [4] for some represen- 
tative choices of A. When A is observed to be small, the 
a posteriori probability distribution of "0 has a very nar- 
row spike near A and extends broadly from -0 = A to 90°. 
When A is observed to be large, it is more likely that is 
close to A. Just as Mp cannot be constrained strongly by 
MpSmio for a Doppler-detected planet, we find that ip 
cannot be constrained strongly by A for an RM-detected 
planet, although the nature of the constraint is more 
complex in the latter case. In particular, for the edge-on 
case, it is possible to distinguish whether an orbit pro- 
grade or retrograde without ambiguity, even though the 
value of Tp is quite uncertain. 

As mentioned in § 2, in principle one may obtain the 
missing information about is via asteroseismology, or 
via the combination of estimates of the stellar radius 
Rs, projected rotation velocity usinig, and stellar ro- 
tation period Pg. Although asteroseismology has never 
been undertaken for a transiting exoplanetary system, 
the other rnethod has been employed fo r HD 189733 
(iWinn et al.l l2007at iHenrv fc Winnl]200l and CoRoT- 
Exo-2 (jBouchv et al and in both cases it was 

found that sini^ is consistent with 1 (i.e., the equator 
is edge-on). However near sinig = 1 a small error in the 
measured sini^ leads to a big error in ig- Therefore the 
data exclude only highly misaligned systems {ip <L 45°). 
In the calculations to follow regarding the entire ensem- 
ble, we chose not to make use of these constraints spe- 
cific to HD 189733 and Corot-Exo-2, for simplicity and 
because the extra information does not lead to signifi- 
cantly more powerful constraints. 
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Fig. 4. — Probability distributions for the true spin-orbit angle 
^ conditioned on the projected spin-orbit angle A, assuming an 
edge-on or bit ( ip = 90°) and random spin-orbit alignment. See 
equations l(2T] l-ll22ll. 



Even when the rotation period is not available, one 
may exclude very nearly pole-on configurations of the 
star because they would require the star to be rotating 
unrealistically rapidly. Certainly the rotation rate can- 
not exceed the breakup speed, although in practice we 
find that in realistic cases, applying this constraint does 
not make a perceptible difference in the distribution for 
ip. One might go further by applying an a priori con- 
straint on is to enforce agreement with the "typical" ro- 
tation rate for a star of the given spectral type and age. 
For the present study we chose not to apply any such 
constraint, to avoid complications due to the uncertain- 
ties in the stellar types and ages, the intrinsic scatter in 
rotation rates, and the possibility that the rotation rates 
of stars hosting close-in giant planets may be systemati- 
cally different from stars in general (due to tidal torques, 
earlier generations of "swallowed" planets, or other un- 
foreseen effects) . 

4. INFERENCES FROM AN ENSEMBLE OF SYSTEMS 

We have seen that ip cannot be tightly constrained in 
an individual system, even when A has been measured 
to within a few degrees, and even when sinig is con- 
strained by a measured rotation period, stellar radius, 
and projected rotation rate. The purpose of this section 
is to derive stronger constraints by combining the re- 
sults from d i fferen t systems. The first such attempt, by 
|W inn et all (|2006l ). demonstrated that the 3 measure- 
ments of A available at that time were strongly inconsis- 
tent with an isotropic distribution of spin-orbit angles. 
This conclusion has been strengthened with the addition 
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of many more systems with small values of |A|, and it 
is now clear that ns and no are correlated. The next 
natural question is: given the RM data, what is the dis- 
tribution of spin-orbit angles? For instance, (a) is there a 
"typical" value which describes the mean and dispersion, 
and (b) is a single smooth distribution a good descrip- 
tion of the data, or is there evidence for more than one 
population? 

To answer these question, we suppose that the 
spin-orbit angles of the systems under consideration 
were drawn from a probability distribution piip) (the 
"model"), and we us e the data to const rain the mathe- 
matical form oipiip). I Winn etldl (|2006l ) already showed 
that the isotropic model, pi = ^sinip, is untenable. A 
good theory of planet formation, migration, and evolu- 
tion should be able to supply p{ip), or at least its general 
form. We will not attempt to develop such a theory here. 
Instead we will use simple mathematical forms of p('0|a) 
with a few free parameters and derive the probability dis- 
tribution for those parameters, conditioned on the data. 

Let the model parameters form a vector a. The data 
consist of measurements of A and io for A^s = 11 sys- 
tems. These "measurements" are themselves probability 
distributions for A and io- We neglect the error in io, 
and denote by Pobs,fe(A) the probability distribution for 
A based on the observations of system k (from Table [1]) . 
We approximate the measurements as Gaussian distribu- 
tions with the quoted A and a\ as the mean and standard 
deviation.^ The model p(V'|a) implies a certain probabil- 
ity distribution for A, given by 

p'(AKo,a)=/ p{X\i^,io)pma.) d^, (23) 

JQ 

where p{X\ip,io) is given by equation (|17p . In practice 
this integral is problematic because it integrates over the 
singularities visible in Figure O but we found that the 
singularity handlers in Mathematica are able to perform 
the integral numerically. For edge-on orbits {io — 7r/2), 
the simplified version of p{X\ip,io = 7r/2) given by equa- 
tion (I2ip is applicable, and the transformation 

removes the singularity. This provided a useful check on 
the ability of Mathematica to handle the singularities; for 
io — 7r/2 the numerical integrals were identical whether 
or not the transformation of equation (|24p was employed. 
We may write the conditional probability as 

p(data|a) = ]J / pobs,fc(A)p'(A|ip,fc, a)dA, (25) 

and then use Bayes' theorem, 

p(a|data) oc p(data|a)p(a), (26) 

^ In some cases, even when the radial- velocty measurement errors 
are Gaussian, the posterior distribution Pobs,fc(^) is not Gaussian. 
This is especially true of systems with slow stellar rotation rates 
or small transit im pact parameters [see, e.g ., the TrES- 2 system 
Winn et all ll2008l) or the HAT-P-2 system Iwlnn et al.l l l2007bl) : 
Loeillet et al.l 1120081 )]. We investigated the sensitivity of our results 
on the assumption of a Gaussian distribution in A by using the ac- 
tual posterior distribution for A whenever we had enough informa- 
tion to compute it. We found that the ensemble results were not 
significantly affected, because the most non-Gaussian cases were 
those with large errors, which were already downweighted in the 
Bayesian analysis. 
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Fig. 5. — The Fisher probability distribution (see eq. 1271 ). for 
some representative values of the concentration parameter k. For 
K — > the Fisher distribution becomes an isotropic distribution 
(^sini/i), and for k ^ oo and ■0^0 the Fisher distribution be- 
comes a Rayleigh distribution with width parameter a = 
The displayed curves have k = 0.0, 0.4, 0.9, 1.6, 3.1, 7.1, and 29. 
These values were chosen because our prior, p{k) oc (1 + k^)"^''^, 
assumes equal probability for each interval between adjacent values 
of K. 



where p(a) is the prior probability density that is as- 
signed to the parameters a. Next, let us choose distribu- 
tions to test. 

4.1. A Fisher distribution 

If ip were a Cartesian coordinate instead of a polar 
angle, one might model piip) as a Gaussian distribution 
with zero mean and variance tr, and derive the probabil- 
ity distribution for a conditioned on the data. In the the- 
ory of directional statistics, the function that plays the 
same widespread and s uitably gener ic role as the Gaus- 
sian distribution is the iFisheH (|1953D distribution. 



Pf{iP\h) 



2 sinh K 



exp(K cos Ip) sin -0, 



(27) 



where k is the concentration parameter, a measure of 
the concentration of the probability distribution around 
^ = 0. For K = 0, the distribution becomes the isotropic 
distribution pi{ip) — ^ sin-^. For k ^ 1 and ■0 ~* 0, the 
distribution becomes a Rayleigh distribution 



PR(0k) = -4exp 



2^ 



(28) 



with a width parameter a — kT^I"^ . Mathematical prop- 
erties of the Fisher distributio n, and it s relat ion to other 
distributions, can be found in IWatsonl ()1982^. Some ex- 
amples of Fisher distributions are plotted in Figure [5l 

Let us assume that the spin-orbit angles are drawn 
from a Fisher distribution, and derive the probability 
distribution for k conditioned on the data. We choose 
a prior distribution p(k) oc (1 -|- k^)~'^/^. This has the 
desirable limits p(k) constant for k ^ (for broad dis- 
tributions it is uninformative in k), and p((t) constant 
for K — > oo (for narrow, Rayleigh-like distributions it is 
uninformative in a = k^^/^). The particular values of 
K for which the distributions are illustrated in Figure [5] 
were chosen because each interval in k is equally likely, 
according to our prior. 
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Fig. 6. — Modeling the RM ensemble with a Fisher distribution, 
(a) The line with dots shows the probability of the concentration 
parameter ft, conditioned on the data. The dots show the specific 
values of ft for which we computed the posterior probability. The 
line with I's shows the assumed prior distribution of ft. The a;'s 
show the values of ft for which the Fisher distributions are illus- 
trated in Figure [5] (b) Correspondence with the more familiar 
Rayleigh distribution (eq. 1281 ). using the equivalence a = k~^I'^ 
that is motivated by the high-ft, low-tr limit. Lines and points 
have the same meaning as in panel (a). In neither panel are the 
prior distributions normalized. The posterior distributions are nor- 
malized by t he "evidence" for the Fisher distribution model (see 
equation [3Dl . 



The probability density for each value of f«, conditioned 
on the data, is the product of the prior and the probabil- 
ity density of the data given the value of k, according to 
equation (j26|) . Figure [D^a) shows the resulting function 
p(K|data), based on the 11 available RM measurements. 
It has been suitably normalized to unit probability. [The 
prior is also displayed in Fig. [6l^a), for reference.] 
Based on this result, we find that the characteristic con- 
centration parameter is k > 7.6 with 95% confidence. 

The results for ft can be converted to a characteristic 
angular dispersion a (in degrees) using a = bear- 
ing in mind that the Fisher distribution with concentra- 
tion parameter k is the same as the Rayleigh distribution 
with width parameter in the limit of f« ^ oo. The 

resulting distributions pF(CT|data) and p((t) are plotted 
in Figure inijb). The width parameter a is less than 22° 
with 95% confidence. 

We now examine the sensitivity of these results to cer- 
tain aspects of the input data. First, we recompute the 
results without including the XO-3 data. This is because 
the finding of a strong misalignment in that system was 
considered tentative by the observers themselves. A stan- 
dard RM model does not provide a statisticall y accept- 
able fit to the XO-3 data ( Hebrard et al.ll2008f ). It may 
be relevant that some of the data were contaminated by 
bright moonlight, requiring significant corrections to be 
applied to the spectra, and some of the data were taken 
at very high air masses. 

Figure [7] show the results when XO-3 is ignored and 
the other 10 systems are included as before. The results 
are very different: the distribution is tightly constrained 



all systems 
wittiout X0-3b 
'ithout HD 189733b 




35 



Fig. 7. — The same as Figure|6lb), but restricting the analysis to 
10 systems instead of 11, by removing either XO-3 or HD 189733. 
The results are very different in those cases. Thus the results de- 
pend strongly on the few systems for which A is measured with 
the best precision, and the single system with an apparently large 
misalignment. The Solar value of t/> is shown for reference. 



near zero: tr < 7° with 95% confidence. That these ten 
systems are consistent with perfect alignment (cr = 0) 
can be seen by computing — ^^(•^/'''a)^ from Table [TJ 
it is 11.7, with 10 degrees of freedom. Therefore, apart 
from XO-3, the non-zero values of A are consistent with 
observational errors alone. The p((t) distribution maxi- 
mum likelihood is at non-zero a because the reduced ^ 
is greater than 1, but this departure from cr = is not 
statistically significant. 

Second, we investigate how sensitive are the results to 
the few most precise RM measurements. The dashed 
line in Fig. [7] show the results if we ignore HD 189733, 
the system with the smallest error bar cta, and include 
all 10 of the remaining systems. In this case, the XO-3 
result has enough statistical weight to pull the peak of 
the distribution strongly away from zero. Also plotted 
for reference is ^0, the Solar spin-orbit angle. It must be 
remembered, though, that i/i© is a particular spin-orbit 
angle while a describes the dispersion of V'- The Solar 
value of "00 is typical of the "allowed" spin-orbit angles 
of hot Jupiters with their host stars. 

With an eye towards future statistical analyses using 
the Fisher distribution, we note that the most computa- 
tionally challenging aspect of the analysis was performing 
the integral ([23]) . A major simplification is available for 
nearly edge-on orbits, if the data are already known to 
favor a highly concentrated distribution («; ^ 1). In this 
case the hypothesized distribution for ?/' is a Rayleigh 
distribution of parameter c <C 1, and the distribution of 
A is Gaussian with a standard deviation a. The prob- 
lem of constraining the distribution of ?/; is reduced to 
the problem of determining the true standard deviation 
of a distribution from which several noisy data points 
have been drawn. The dispersion a can be estimated 
by finding the value of a that gives = when it is 
added in quadrature with the measurement errors. The 
ten systems besides XO-3 are in this simple regime. The 
simplified procedure described in this paragraph gives 
a — 1.1°, in agreement with the maximum likelihood 
value given for those ten systems in Figure ([7]). 
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f = fraction of isotropic systems 



Fig. 8. — Modeling the RM ensemble with the sum of an isotropic 
distribution and a perfectly-aligned (delta-function) distribution. 
A fraction / of the systems are drawn from the isotropic distri- 
bution, and the remaining fraction (1 — /) are drawn from the 
delta-function distribution. Plotted is the posterior probability 
distribution for /, given the data. We find / < 0.36 with 95% 
confidence. 



4.2. A sum of two distributions: isotropic and 
perfectly-aligned 

An alternative and equally simple way to describe the 
data is to suppose that all systems are drawn either from 
an isotropic distribution (with probability /) or from a 
very well-aligned distribution (with probability 1 — /). 
We further suppose that the well-aligned distribution is 
sufficiently concentrated around ijj — that none of the 
current measurements would be able to distinguish it 
from a delta function. This toy model will be a use- 
ful baseline for limiting the fraction of planets that mi- 
grate by various channels, some of which yield a nearly 
isotropic distribution of spin-orbit angles (specific exam- 
ples are cited in fjs]). 

The probability of the data (Afe, ax.k) given such a 
model is: 

(29) 

The factor of 2 in the numerator of the Gaussian por- 
tion above arises from the ±A degeneracy mentioned in 
121 Both of the terms in the sum are independent of io- 
Adopting a uniform prior for /, and using Bayes' theo- 
rem, we plot the result for p(/|data) in Figure [51 The 
data demand that / < 0.36 with 95% confidence. The 
favored value of / is 0.1, implying that approximately 
one system out of 11 is drawn from the isotropic distri- 
bution, clear indication that this result is being driven by 
XO-3. If we remove XO-3 from the analysis (to check the 
sensitivity of the analysis to this one system), / < 0.25 
at 95% confidence, and the maximum likelihood is / = 
exactly. 

4.3. A sum of two Fisher distributions 

Another possible model is a sum of two Fisher distribu- 
tions with different concentration parameters. This could 
describe, for instance, two different channels by which 
giant planets migrate to close-in orbits, which produce 
differe nt final distributions of spin-orbit angles. iBrownl 
(|2001) has found that a sum of two Fisher-like distri- 
butions is needed to fit the inclination distribution of 
Kuiper belt objects, and this multi-component model 



has been a very useful constraint on dynamical theo- 
ries. In this "two-Fisher" model, a fraction / of systems 
are Fisher-distributed with concentration parameter ki, 
and the remaining fraction (1 — /) of systems are Fisher- 
distributed with concentration parameter K2- Thus the 
two-Fisher model has three free parameters. The results 
of the previous section correspond to the case / « 0.1, 

Kl ~ 0, K2 —^ OO. 

To determine the joint posterior probability distribu- 
tion for /, Ki, and K2, we computed 

P2f(/, Ki,K2|data)=p(/)p(Ki)p(K2) X (30) 
/pi?(data|Ki) + (1 — /)p_F(data|K2) . 

The difficult integrals implied by this equation were al- 
ready computed for the single-Fisher model. The re- 
sults are / = O.lOt^J;^^, ki = 0.22tJ:°^, and K2 = 
110^yg°. The ffi'st Fisher distribution is consistent with 
an isotropic distribution. The second Fisher distribution 
is approximately equivalent to a Rayleigh distribution 
with width parameter a2 = 5.5]t2'4 deg. The proba- 
bility contours show that the correlations between the 
parameters are relatively small, especially within the 1-a 
preferred region. 

4.4. Choosing among different models 

The results of the 3-parameter, two-Fisher model 
are consistent with the results of the single-parameter, 
isotropic -I- perfectly-aligned model given in the previ- 
ous section. The greater complexity of the two-Fisher 
model makes it less appealing. This loss of appeal can 
be quantified within a Bayesian framework, which has a 
quantitative expression of Occam's razor. ^ Each model 
has an associated "evidence," 

E ^ J p(data|a)p(a)da, (31) 

where a is the vector of model parameters. This is the 
normalization factor used in Bayes' theorem, which turns 
proportionality ([26| into an equation. The presence of 
the first factor, p(data|a), is the quantitative expression 
of the intuition that the data may be taken as evidence 
for the model only when the model predicts the data are 
probable. The second factor, p{a), assigns greater evi- 
dence to models that concentrate their predictive power 
in the region where the data are found. This is because 
the integral over the prior distribution p(a) is normal- 
ized to unity; if a prior distribution is spread too thinly 
over the parameter space of a, then it cannot give much 
weight to models that are consistent with the data. 

We computed the evidence for the models described in 
the three previous sections. The single-Fisher model has 
E = 14.4, the isotropic 4" perfectly-aligned model has 
E = 1927, and the two-Fisher model has E = 726. The 
model that mixes the two extreme distributions does the 
best. It is favored by a factor of ^ 130 over the single- 
Fisher model. 

The difficulty with the single-Fisher model is that it 
cannot simultaneously account for the majority of sys- 
tems that are well-aligned while also including XO-3. A 

^ A lucid d iscussion of Bayesian model choosing is given by 
IMacKavl[2003. ch. 28. 
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small value of k makes XO-3 probable, but the 10 other 
systems are somewhat less probable, and the multiplica- 
tion of these 10 lessened probabilities according to equa- 
tion (|25p means there is little evidence for small k. A 
large value of k makes most of the data probable, but 
then the XO-3 result is very improbable, and the result 
is poor evidence for large k. The distribution that mixes 
an isotropic distribution and a perfectly-aligned distri- 
bution overcomes this difficulty by allowing both the ma- 
jority of systems and XO-3 to be reasonably probable. 
The only free parameter is /, and the constraints on / 
are loose. Thus, the data provide substantial evidence 
for a large fraction of the parameter space. 

The poor showing of the two-Fisher model relative to 
the isotropic -I- perfectly-aligned model indicates that the 
two-Fisher model is a needless complication. The ex- 
tra two parameters open up two more dimensions in the 
model's parameter space. As a result, much of the prior 
probability is "wasted" on regions of parameter space 
that are ruled out by the data. We hope that some day 
there will be enough high-quality RM data to justify a 
more complicated model such as the two-Fisher model. 
In this regard we note that 379 Kuiper belt objects were 
discovered bef ore their incl ination distribution was mod- 
eled this way (lBrownll200ll ). 

We have already shown that the XO-3 measurement 
has an especially strong influence on the results. Unfor- 
tunately, as mentoned earlier, this measurement is con- 
sidered suspect because of the possibility of systematic 
errors. Following the referee's suggestion, we may bring 
this suspicion under the umbrella of the Bayesian analy- 
sis by including a "degree of belief" parameter pi- , which 
gives the a priori probability that the XO-3 measure- 
ment will prove to be correct. Since the nature of the 
systematic errors (if any) is not known, is rather sub- 
jective and open to debate. Our goal is not to determine 
the value of pr, but rather to ask what is the minimum 
value of Pr that is required for our conclusion to hold that 
the isotropic -I- perfectly-ahgned model is preferred. 

Each of the three models — single-Fisher, perfectly- 
aligned plus an isotropic fraction, and two-Fisher — is fit- 
ted to the 11-mcmber ensemble including XO-3, and also 
fitted to the 10-member ensemble excluding XO-3. Then 
the evidence for each model is computed as a weighted 
sum, with weight p^ applied to the 11-member set and 
weight (1— Pi ) to the 10-member set. The evidence values 
of the models are computed and compared as a function 
of Pr- The result is that for pr < 0.95, all three models fit 
the data equally well; they have evidence values within 
a factor of 3 of each other. We conclude that unless one 
has >95% confidence that the XO-3 result is robust, then 
a single smooth distribution of -0 is a perfectly viable de- 
scription of the ensemble. 

5. DISCUSSION 

The angle between and Mq is a fundamental geomet- 
ric property of exoplanetary systems. A good theory of 
planet formation, migration, and evolution ought to pre- 
dict the statistical relationship between iig and rio for hot 
Jupiters. This relationship is potentially measurable via 
RM observations. In this paper we have overcome an in- 
herent limitation of RM observations — they are sensitive 
only to the angle between the sky projections of the or- 
bital axis and the stellar rotation axis — by showing how 



to analyze the whole ensemble in a Bayesian framework. 

We modeled the 11 published measurements using a 
Fisher distribution, and found that the concentration pa- 
rameter K > 7.6 with 95% confidence. In this limit of a 
rather concentrated distribution, the Fisher distribution 
is equivalent to a Rayleigh distribution with width pa- 
rameter (T — n^^/^. Based on the 11 data points, a < 22° 
with 95% confidence. For comparison, the Solar obliquity 
is 7°. If we set aside XO-3 (for instance, if the "tenta- 
tive" detection of a strong misalignment is contradicted 
by higher-precision data), then the width parameter is 
< 6.6° with 95% confidence. In that case the hot Jupiters 
are just as well-aligned as the Solar system. 

The 11 data points also provide statistical evidence 
for two distinct populations within the ensemble, which 
might be interpreted as two different migration channels. 
Specifically, a model in which the systems are drawn 
from the sum of isotropic and perfectly-aligned distri- 
butions fits the data better than a model with a single 
smooth distribution (§ 14. 4p . This is a reasonable con- 
clusion, given that XO-3 shows the only evidence for a 
strong misalignment. However, due to the projection ef- 
fect in converting from ip to A, it was not obvious prior 
to our analysis that the XO-3 result cannot be accomo- 
dated as part of the "tail" in a smooth distribution of 
spin-orbit angles. In fact the data do not imply that 
XO-3 is the only system in the ensemble that is likely 
to be drawn from an isotropic distribution. We conclude 
only that fewer than 36% of the systems are drawn from 
an isotropic distribution, with 95% confidence. It is pos- 
sible that several members of the ensemble were drawn 
from an isotropic distribution. 

There is plenty of room for improvement in the quan- 
tity and quality of the RM data that are needed for this 
type of study. We have found that the present data are 
sufficient only to constrain single-parameter models for 
the ensemble. We also showed that the current results are 
highly sensitive to the few systems with the finest mea- 
surement precision. Measurements of A with a precision 
of a few degrees are still in high demand. The "tenta- 
tive" result for XO-3 needs to be followed up with more 
definitive data, as that single result weighs heavily on the 
Bayesian calculation. In addition, a uniform analysis of 
the data across many systems would be useful. Among 
the nonuniform aspects of the data analyses are whether 
or not V sin i was treated as a free parameter or subject 
to a constraint based on the observed line-broadening; 
whether or not the uncertainties in the photometric tran- 
sit parameters were taken into account; whether or not 
correlated noise in the radial-velocity data was assessed 
and taken into account if necessary; and whether or not 
the effects of spectral deconvolution and cross-correlation 
algorithms were calibrated. (These algorithms need cal- 
ibration because they are generally intended to measure 
Doppler shifts, rather than model the actual RM spec- 
tral distortion that only superficially resembles a Doppler 
shift.) 

There is also a potential bias regarding which systems 
are sele cted for mea s ureme nt of the RM effect, as pointed 
out by I Winn et al.l ()2008 f) . Stars with low v sin i tend 
to be avoided, as the radial velocity anomaly would be 
small and the achievable precision in A is comparatively 
poor. However, stars with low usini^ are more likely to 
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be viewed pole-on and therefore have a large ^p. In the 
present work we have not attempted to correct for such 
a selection effect. 

Despite the limitations of the current data, 
the relative success of the two-component model 
(isotropic -t- perfectly-aligned) leads us to speculate on 
the implications for theories that attempt to explain the 
presence of hot Jupiters. The chain of logic begins with 
the assumption that the system begins very well-aligned 
(ip 0°). A natural prediction of in-situ formation 
theories, or theories involving migration due to torques 
from the protoplanetary disk, is that the orbit of the 
planet remains very well-aligned with equatorial plane 
of the star. (However, in the latter case the planet 
could conceivably misalign with the protoplanetary 
disk, and thus the stellar spin, depending on which 
resonant an d secular torques dominate the planet-disk 
interaction; iBorderies et all Il984t IWard fc Hah^ 11994 
ILubow fc Qgilvid 120011 ) " In contrast, a very broad ■tp 
dist ributio n can be produced by mechanisms involv- 
ing iKozail (|1962f) eccentricity cycles due to a distant 
companion star (|Fabrvckv fc Xremaind [20071: IWu et all 
|2007| ) . A nearly isotropic t p distribution can be pro duced 
by dynamical relaxation (iPapaloizou fc T crquem l2001h 
or planet-planet scattering ([Narita et al.l 120081) . It is 
possible that the two components in our statistical model 
correspond to two different channels for migration, one 
that preserves the initial spin-orbit alignment and one 
that randomizes spin-orbit alignment to a significant 
degree. 

A possible confounding factor is tidal damping. Based 
on the currently observed system parameters, it is ex- 
pected that tidal coplanarization (also called "inclination 
damping") is not a major influence on tp (jWinn et al.l 
|2005[) . but this is not a watertight argument. The 
timescales of long-term tidal processes are poorly known. 
Nevertheless, if tides raised on the star are large enough 
for substantial coplanarization, then the planet is in 
imminent danger o f spiraling in and being engulfed 
(jLevrard et al.l [20090 . Moreover, one would think that 
tidal damping of tp by dissipation in the star should be 
fastest for systems with the most massive planets (see, 

''' There is also dissipation within the planet, which is less efficient 
for higher mass planets. However, this mode of dissipation is less 
relevant because the torque on the planetary bulge does not couple 



e.g., [Fabrvckv et"al] [2007t [Jackson et an[2OO80 . Hence 
it is intriguing that X0-3b, the most massive transiting 
planet with an RM measurement, and thus the one for 
which tidal dissipation should have been the most impor- 
tant, is the only system showing evidence for misalign- 
ment.^ Therefore the misalignment of X0-3b suggests 
that tidal damping is not responsible for low ip values, 
and that the observed low |A| values should be inter- 
preted as a relic of the planet formation and migration 
processes. 

In this paper we have been concerned with constrain- 
ing the parameters of simple and fairly generic mathe- 
matical models for the distribution of spin-orbit angles. 
A priority for future work is to use the Bayesian frame- 
work developed in this paper to constrain the parame- 
ters of realistic, physically motivated models, based on 
the specific predictions of migration theories. In this vein 
we encourage theorists to make quantitative predictions 
about the distribution of tp. After deriving a theoreti- 
cal distribution for ip by whatever means, one may find 
the corresponding A distribution for edge-on orbits using 
equation (|19[) . This requires a convolution similar to that 
of equation The theoretical predictions may then 

be directly compared with the data. With RM measure- 
ments of sufficient quantity and quality and with theories 
of sufhcient specificity, it may be possible to rule out cer- 
tain migration theories, or to derive the likely fraction of 
systems that migrated through different channels. 
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